Name | Version | Summary | date |
nncf |
2.14.1 |
Neural Networks Compression Framework |
2024-12-19 12:29:25 |
llmcompressor |
0.3.1 |
A library for compressing large language models utilizing the latest techniques and research in the field for both training aware and post training techniques. The library is designed to be flexible and easy to use on top of PyTorch and HuggingFace Transformers, allowing for quick experimentation. |
2024-12-12 13:05:51 |
llmcompressor-nightly |
0.3.0.20241112 |
A library for compressing large language models utilizing the latest techniques and research in the field for both training aware and post training techniques. The library is designed to be flexible and easy to use on top of PyTorch and HuggingFace Transformers, allowing for quick experimentation. |
2024-11-12 16:42:51 |
deepsparse-ent |
1.8.0 |
An inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application |
2024-07-19 16:32:32 |
deepsparse |
1.8.0 |
An inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application |
2024-07-19 16:29:01 |
sparsezoo |
1.8.1 |
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes |
2024-07-19 16:25:22 |
sparseml-nightly |
1.8.0.20240630 |
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models |
2024-06-30 20:38:38 |
sparseml |
1.8.0 |
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models |
2024-06-21 15:53:30 |
deepsparse-nightly |
1.8.0.20240502 |
An inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application |
2024-05-06 19:42:19 |
sparsezoo-nightly |
1.8.0.20240506 |
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes |
2024-05-06 19:37:51 |